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ABSTRACT 

Surface topography refers to the geometric micro-structure 
of a surface and defines its tactile characteristics (typically in 
the sub-millimeter range). High-resolution 3D scanning tech¬ 
niques developed recently enable the 3D reconstruction of 
surfaces including their surface topography. In this paper, we 
present an efficient image-space technique for the extraction 
of surface topography from high-resolution 3D reconstruc¬ 
tions. Additionally, we filter noise and enhance topographic 
attributes to obtain an improved representation for subse¬ 
quent topography classification. Comprehensive experiments 
show that the our representation captures well topographic at¬ 
tributes and significantly improves classification performance 
compared to alternative 2D and 3D representations. 

Index Terms — 3D surface analysis, surface micro¬ 
structure, topography classification 

1. INTRODUCTION 

Methods for sparse and dense 3D scene reconstruction have 
progressed strongly due to the availability of inexpensive, off- 
the-shelf hardware (e.g. Microsoft Kinect) and robust recon¬ 
struction algorithms (e.g. structure from motion techniques, 
sfM) mia. The result of these techniques is a heavily in¬ 
creased amount of available 3D data. Novel techniques enable 
dense 3D reconstructions at sub-millimeter resolution which 
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accurately capture a surface’s topography 0. This opens up 
new opportunities for search and retrieval in 3D scenes, such 
as the recognition of objects by their surface properties as 
well as the distinction of different types of materials for im¬ 
proved scene understanding. The tremendous amount of data 
in such high-resolution reconstructions, however, requires ef¬ 
ficient processing methods. 

The appearance of a 3D surface can be seen as the compo¬ 
sition of the visual appearance (visual texture, image texture) 
and the tactile appearance (surface texture) aa. Visual ap¬ 
pearance refers to color variations, brightness, and reflectiv¬ 
ity. Surface texture refers to the geometric micro-structure of 
a surface in terms of roughness, waviness, and lay laa. It 
is defined as the repetitive random deviation from the “ideal” 
surface. This deviation forms the three dimensional topogra¬ 
phy of a surface [(3 . Compared to the visual appearance of 
a surface, its topography is invariant to lighting and viewing 
conditions and thus a robust basis for its description. 

Related research has mainly focused on the description 
and classification of surfaces in terms of their visually appar¬ 
ent texture, fostered by the broad availability of image texture 
datasets iiiiiiia. Texture classification is strongly related to 
the description of surface topography, it is, however, highly 
sensitive to different lighting conditions. Thus, a major goal 
in texture classification is the illumination-invariant modeling 
of texture from one or more provided input images. In the 3D 
domain the situation is fundamentally different. Illumination 
becomes negligible, as the entire surface geometry is known. 
The major challenge becomes the extraction of the surface 
micro-structure from 3D data and the robust description of its 
attributes. 

For the representation of surface topography, descriptors 
are required that capture the local geometry around a given 
point. A large number of local 3D descriptors has been devel¬ 
oped for this purpose, such as 3D Shape Context (3DSC) ifTlI 
and Point Feature Histograms (PFH) lfT3 . Their major limi¬ 
tation is, however, the high computational cost of their dense 
extraction from an entire point cloud which is necessary to 
capture the topography continuously across a surface. An al- 
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ternative approach to represent surface topography is to ex¬ 
tract high-frequency information from the point cloud in 3D 
and to map this information to 2D by non-linear dimension¬ 
ality reduction ca. The non-linear mapping is, however, a 
global operation on the entire point cloud which is infeasi¬ 
ble for point clouds with several million of points. To enable 
the efficient extraction of topography from high-resolution re¬ 
constructions, we propose an alternative approach: Instead of 
extracting local 3D descriptors densely over an entire point 
cloud or applying global operations on the point cloud, we 
propose an image-space method that first maps the surface 
to 2D and then reconstructs and enhances the topography ef¬ 
ficiently in the image domain. The result is a 2D topogra¬ 
phy map which is well-suited for further processing. The 
approach is computationally efficient and thus applicable to 
large-scale data. 

In Section [2] we describe our approach for topography ex¬ 
traction. SectionOpresents the experimental setup and results 
on a set of high-resolution surfaces. We draw conclusions and 
point out future work in Section [H 

2. PROPOSED APPROACH 

We first present topography extraction from 3D data and then 
enhance topographic attributes for topography classification. 

2.1. Image-space topography extraction 

The input to our approach is a high-resolution point cloud 
with P points. In a first step, we estimate a support plane 
for the cloud that minimizes the least squares distances to the 
plane. Next, we estimate the location of each 3D point on 
the support plane by an orthographic projection with projec¬ 
tion direction set to the normal direction of the support plane. 
We map the signed distances between the 3D points and the 
support plane to the respective locations. The result is a 2D 
depth map of the 3D surface. Figure [T] illustrates the pro¬ 
cess. Figures and [T(b^ show the original point cloud in 

3D and from projection direction. The depth map is shown 
in Figure |l(c)| It most prominently shows the global depth 
variations related to the curvature of the surface. The local 
micro-structure is hardly visible. 

The extraction of the surface topography from the depth 
map requires the compensation of the global curvature. We 
first smooth the depth map to obtain the global depth dis¬ 
tribution of the underlying 3D surface by convolution with 
a two-dimensional Gaussian-shaped filter G(x, y) with stan¬ 
dard deviation a and a support of pixels. The filter is 
applied to all possible locations (x^y) of the original depth 
map D by convolution. The result D{x, y) is an estimate of 
the local average at all surface locations. We subtract the local 
average from the depth map at every location hy T{x^y) = 
D{x,y) — D{x,y). The result is a compensated depth map T, 
the topography map, which is free of global geometric vari- 



(a) original point cloud (b) point cloud viewed from projec¬ 

tion direction 



(c) projected depth map (d) topography map T 


Fig. 1. Extraction of surface topography: (a) the 3D point 
cloud; (b) the point cloud viewed from projection direction; 
(c) the depth projection of the cloud; (d) the extracted topog¬ 
raphy map. 


ations. The topography map captures well the local micro¬ 
structures of the surface (e.g. crannies and fine surface irreg¬ 
ularities) that are not recognizable from the depth map, see 
Figure |l(d)| 

As most surfaces are in general not developable (non¬ 
zero Gaussian curvature) they cannot be flattened onto a plane 
without a certain amount of distortions. Depending on the 
precision required by the given application this may set a 
limit to the amount of global curvature contained in the sur¬ 
face. Since topographic attributes have local spatial support, 
a straight-forward approach to limit distortions is to split the 
surface into quasi-linear patches so that the overall curvature 
per patch remains limited. 

The computation of the topography map is computation¬ 
ally efficient. The orthogonal projection can be performed in 
linear time 0{P) with respect to the number of points P in 
the input point cloud. The convolution takes 0{Nk), where 
N is the number of pixels in the projected image and k the 
number of pixels of the kernel. The remaining operations are 
in 0{N). Thus, the method is linear in P and N. 

2.2. Enhancement of topographic attributes 

The topography map T is signed. The sign encodes whether 
a projected point is below or above the smoothed average sur¬ 
face and thus enables the distinction between peaks and val¬ 
leys in the surface 

Peaks and valleys are important building blocks of the sur¬ 
face topography and are strongly related to the topographic 
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attributes roughness and lay CHISl. To obtain expressive rep¬ 
resentations for subsequent topography analysis, we extract 
two maps that capture the spatial distributions of peaks and 
valleys, respectively. Additionally, we filter noise present in 
the map that originates from data capturing and 3D recon¬ 
struction (sensor noise, outlier points, and noise at depth dis¬ 
continuities). 

Positive values in the topography map indicate surface 
points below the smoothed surface (valleys) while negative 
values represent local peaks. We first split the topography 
map T into a positive and a negative part, and by: 

T'’(x, y) = max{f{x, y), 0) (1) 

fPix,y) = \min{f{x,y),0)\, (2) 

where contains the positive portion of T which represents 
the map of the local valleys and captures the negative por¬ 
tion and provides the map of the local peaks. Next, we locally 
smooth the image with a Gaussian filter G (as defined in Sec¬ 
tion 12.11) to remove noise and discontinuities introduced by 
cropping in Equations [T] and O As a result we obtain two 
filtered maps TJ = * G and * G. 

In areas where the surface has strong irregularities in 
depth (e.g. along cracks and crannies), outliers are often ob¬ 
tained during 3D reconstruction. The outliers bias the value 
range of the maps and skew the distribution. To reduce their 
influence (and at the same time to avoid their explicit detec¬ 
tion), we take the logarithm of both maps: E'^ = log{Tg) and 
E'P = logiT'^). Logging compensates the outliers’ influence 
and makes the value distribution approximately Gaussian. As 
a result the topography maps of peaks and valleys, E^ and 
E^, are improved. 

3. EXPERIMENTS AND RESULTS 

We first demonstrate the effect of topography extraction and 
enhancement qualitatively on an example surface and then 
evaluate the capabilities of the proposed representations quan¬ 
titatively by topography classification experiments. 

3.1. Dataset 

There is currently no publicly available dataset that provides 
high-resolution 3D reconstructions of surfaces together with 
ground-truth annotations of their surface topography. A re¬ 
search domain that intensively analyzes surface properties is 
archeology. Archeologists generate high-resolution 3D re¬ 
construction to represent findings and artifacts. For our ex¬ 
periments we employ a set of high-resolution 3D reconstruc¬ 
tions of natural rock surfaces made by archeologists. The 
rock surfaces exhibit human-made engravings (so called rock 
art) which exhibit a different surface topography than the sur¬ 
rounding rock surface. As there additionally exists ground- 
truth for the engravings, the data is well-suited for topogra¬ 


phy classification and thus to evaluate the capabilities of our 
representations. 

The employed dataset contains 4 high-resolution surface 
reconstructions with a total number of 12.3 • 10^ points. The 
resolution is below 0.1mm in X,Y, and Z direction. For each 
surface a precise ground truth exists that labels the natural 
rock surface (class 1) and all engravings (class 2). Class 2 rep¬ 
resents only 16.6% of the data and is thus underrepresented. 

3.2. Qualitative evaluation 

We demonstrate the effect of topography extraction and en¬ 
hancement on an example surface from our dataset. First, we 
specify the size of Gaussian filter G. The size of G is a de¬ 
sign parameter that specifies at which granularity topographic 
structures should be enhanced. As we want to enhance topo¬ 
graphic structures related to engravings on the rock surfaces, 
we set G to approximately the size of an individual engraving 
(approx. 4mm, which refers to W=62 pixels). Figure^ shows 
the result of topography extraction for a rock surface with a 
humanoid-shaped engraved figure. Figure |2^ shows the vi¬ 
sual appearance (image texture) of the surface obtained with 
oblique shading. The depth map in Figure [2^ clearly shows 
the global curvature of the surface (along the horizontal di¬ 
rection). At the same time, the depth map superimposes the 
fine surface details. The topography map in Figure [^^com¬ 
pensates the global curvature of the surface and captures well 
the surface micro-structure. Figure [2(d)| shows the enhanced 
map E^. Areas in E'^ with high energy (reddish color) indi¬ 
cate valleys in the surface. The valleys captured by E'^ cor¬ 
respond well with valleys in the surface which originate from 
crannies and engravings. This shows that the map models the 
desired topographical information. Topographic structures at 
finer and coarser granularity can be extracted by adapting W. 

3.3. Quantitative evaluation 

To evaluate the suitability of the proposed representation we 
perform experiments on surface topography classification us¬ 
ing the dataset from Section [3T] As a baseline for comparison 
we employ (i) a full 3D appoach based on local 3D descriptors 
and (ii) several commonly used 2D representations as alter¬ 
native to our enhanced topography maps (ETM). For the 3D 
approach we perform a dense analysis of the surfaces with 
four local 3D descriptors (PFH CSl, FPFH Cll, SHOT CSl, 
and 3DSC CD) and apply classification directly on the 3D 
descriptors. 

For the 2D representations we consider the task of topog¬ 
raphy classification as a texture classification task on the re¬ 
spective representation. We compare the classification per¬ 
formance of our map with that of (i) the visual appearance 
of the surface (visual texture), (ii) the depth map (pure depth 
information), and (iii) the depth gradient map (DGM). DGM 
is the local gradient of the depth map. The DGM empha¬ 
sizes local fine structures and neglects global depth variations 
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(c) topography map 

Fig. 2. A 3D reconstruction of a rock surface in which the 
shape of a human figure has been engraved. The enhanced 
topography map in this example) accentuates well the 
valleys corresponding to the engravings and makes it easy to 
distinguish the engraving from the surrounding rock surface. 


and has been successfully employed in ifTTll . To capture the 
surface topography from the 2D representations, we extract 
the following baseline features (in a block-wise manner): Lo¬ 
cal Binary Patterns (LBP) SIFT descriptors (191, His¬ 
tograms of Oriented Gradients (HOG) |[20l . and Gray-Level 
Co-occurrence Matrices (GLCM). 

Additionally, we propose two easy to compute features for 
the description of the ETM: (i) global histogram shape (GHS) 
which contains the 30 first low-frequency DCT coefficients 
of the global histogram of an image block and (ii) spatial fre¬ 
quencies (SF) which contains the first 8x8 low frequency 2D 
DCT coefficients of a given image block. 

For classification we employ Random Undersampling 
Boosting (RUSBoost) which is especially designed for imbal¬ 
anced class distributions ED. We split the dataset into train¬ 
ing and test surfaces and perform 10-fold cross-validation 
on the training set. We compute recall and precision for 
both classes and report the fl score of the underrepresented 
class for each experiment which is most expressive to assess 
the overall performance. We apply Fisher’s randomization 
test II 22 II and Student’s paired t-test with a significance 
level of 0.05 to judge performance differences. 

Figure[3] shows boxplots of the classification results for all 
employed 2D and 3D representations and features. The 3D 
baseline obtained from dense 3D descriptors (“Dense 3D”) 
yields weak results compared to the 2D representations (four 
leftmost results in FigureO with a maximum fl of 44.1% for 
PFH. The 3D descriptors do not seem to discriminate well 
between the two topological classes. 

Using purely visual information (image texture of the or¬ 
thographic image) does also not solve the task. Results on the 



3D representations and features. Our proposed representation 
(ETM) significantly outperforms all other representations. 


color image are weaker than that of the other 2D representa¬ 
tions, with a maximum fl score of 51.7% for LBR This shows 
that color information alone is not sufficient for the classifica¬ 
tion of surface topography. We observe that the incorporation 
of 3D information in terms of depth (in the depth map and 
in DGM) partly improves results significantly compared to 
color. The best results on DGM are obtained by LBP (60.2%) 
and on the depth map by HOG (67.9%), which is at the same 
time the best result obtained by a baseline method. 

Experiments on the enhanced topography map (7 right¬ 
most results on Figure [3]) show that typical texture features 
such as LBP and HOG cannot benefit from our representa¬ 
tion (fl < 50%). GLCM, however, strongly benefits and 
significantly outperforms the other features evaluated so far 
on the ETM (fl = 65.7). A further gain in performance is 
only obtained by the proposed features (GHS and SF) which 
both yield an fl of 69.0%. GHS as well as SF on ETM sig¬ 
nificantly outperform all other features (p-value<0.01). Peak 
performance in our experiments is obtained for ETM with the 
combination of GHS and SF (fl = 74.4%, p-value <0.01). 

We assume that the strongly varying performance of the 
different features on ETM has several reasons. The ETM is 
a comparably smooth representation (due to Gaussian filter¬ 
ing). Thus, features that rely on gradients, such as HOG can 
hardly benefit from it. Similarly LBP lacks expressiveness. 
GLCM benefits most, since it takes absolute values as input 
and does not rely on relative differences like HOG and LBR 
We observe the same behavior for GHS and SE which rely on 
absolute values, as well. The high performance of the com¬ 
bined features (GHS-fSE) has two reasons: firstly, both fea¬ 
tures perform well when applied separately; secondly, both 
features represent complementary information: GHS repre¬ 
sents the value distribution globally while SE captures the 
spatial value distribution. 
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4. CONCLUSION 

We have presented an efficient image-space method for the 
extraction of surface topography from high-resolution 3D 
reconstructions. Additionally, we have proposed enhanced 
topography maps as a novel representation for improved to¬ 
pography classification. Our evaluation demonstrates that 
the proposed maps are well-suited for topography representa¬ 
tion and significantly outperform alternative representations 
in classification experiments, such as local 3D descriptors, 
depth maps, and depth gradient maps. Furthermore, experi¬ 
ments show that pure 2D information (image texture) is not 
able to capture the full surface topography. In future, we will 
extend topography maps to multiple scales, to obtain a rep¬ 
resentation that captures topographic structures at different 
granularities. 
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